NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On the Nyström Approximation for Preconditioning in Kernel Machines

Abedsoltan, Amirhesam; Pandit, Parthe; Rademacher, Luis; Belkin, Mikhail (May 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics)

Kernel methods are a popular class of nonlinear predictive models in machine learning. Scalable algorithms for learning kernel models need to be iterative in nature, but convergence can be slow due to poor conditioning. Spectral preconditioning is an important tool to speed-up the convergence of such iterative algorithms for training kernel models. However computing and storing a spectral preconditioner can be expensive which can lead to large computational and storage overheads, precluding the application of kernel methods to problems with large datasets. A Nystrom approximation of the spectral preconditioner is often cheaper to compute and store, and has demonstrated success in practical applications. In this paper we analyze the trade-offs of using such an approximated preconditioner. Specifically, we show that a sample of logarithmic size (as a function of the size of the dataset) enables the Nyström-based approximated preconditioner to accelerate gradient descent nearly as well as the exact preconditioner, while also reducing the computational and storage overheads.
more » « less
Full Text Available
On the Nyström Approximation for Preconditioning in Kernel Machines

Abedsoltan, Amirhesam; Pandit, Parthe; Rademacher, Luis; Belkin, Mikhail (May 2024, PMLR)

Full Text Available
On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions

https://doi.org/10.1137/22M1499819

Beaglehole, Daniel; Belkin, Mikhail; Pandit, Parthe (December 2023, SIAM Journal on Mathematics of Data Science)

Full Text Available
Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks

https://doi.org/10.1109/IEEECONF59524.2023.10476957

Becker, Evan; Pandit, Parthe; Rangan, Sundeep; Fletcher, Alyson K (October 2023, IEEE)

Full Text Available
Matrix inference and estimation in multi-layer models*

https://doi.org/10.1088/1742-5468/ac3a75

Pandit, Parthe; Sahraee-Ardakan, Mojtaba; Rangan, Sundeep; Schniter, Philip; Fletcher, Alyson K (December 2021, Journal of Statistical Mechanics: Theory and Experiment)

Abstract We consider the problem of estimating the input and hidden variables of a stochastic multi-layer neural network (NN) from an observation of the output. The hidden variables in each layer are represented as matrices with statistical interactions along both rows as well as columns. This problem applies to matrix imputation, signal recovery via deep generative prior models, multi-task and mixed regression, and learning certain classes of two-layer NNs. We extend a recently-developed algorithm—multi-layer vector approximate message passing, for this matrix-valued inference problem. It is shown that the performance of the proposed multi-layer matrix vector approximate message passing algorithm can be exactly predicted in a certain random large-system limit, where the dimensions N × d of the unknown quantities grow as N → ∞ with d fixed. In the two-layer neural-network learning problem, this scaling corresponds to the case where the number of input features as well as training samples grow to infinity but the number of hidden nodes stays fixed. The analysis enables a precise prediction of the parameter and test error of the learning.
more » « less
Full Text Available
Generalized Autoregressive Linear Models for Discrete High-Dimensional Data

https://doi.org/10.1109/JSAIT.2020.3041714

Pandit, Parthe; Sahraee-Ardakan, Mojtaba; Amini, Arash A.; Rangan, Sundeep; Fletcher, Alyson K. (November 2020, IEEE Journal on Selected Areas in Information Theory)
null (Ed.)
Full Text Available
Inference With Deep Generative Priors in High Dimensions

https://doi.org/10.1109/JSAIT.2020.2986321

Pandit, Parthe; Sahraee-Ardakan, Mojtaba; Rangan, Sundeep; Schniter, Philip; Fletcher, Alyson K. (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
Plug in estimation in high dimensional linear inverse problems a rigorous analysis

https://doi.org/10.1088/1742-5468/ab321a

Fletcher, Alyson K; Pandit, Parthe; Rangan, Sundeep; Sarkar, Subrata; Schniter, Philip (December 2019, Journal of Statistical Mechanics: Theory and Experiment)

Full Text Available
Generalization Error of Generalized Linear Models in High Dimensions

Emami, Melikasadat; Sahraee-Ardakan, Mojtaba; Pandit, Parthe; Rangan, Sundeep; Fletcher, Alyson K. (January 2020, International Conference on Machine Learning)
null (Ed.)
Full Text Available

Search for: All records